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AB - PROBLEM TO BE SOLVED: To optimize both velocity and efficiency of a 

Java optimization device and a JIT compiler by successively executing 
the forward and backward scanning of a byte code block and generating 
a link to a preceding or subsequent node in a file. 

- SOLUTION: Forward scanning of a byte code source file 102 is executed 
in order to decide respectively the start codes 106, 108 and 110 of 
byte codes 112, 114 and 116 in a block that is not interrupted (100) . 
Then, backward scanning of the file 102 is executed. In the code 116 
is an instruction POP the POP is read and a node 120 is generated in a 
data flow graph including a POP instruction 122. Successively, a value 
2 is read by the code 108 and linked to the instruction 122 (118) 
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K^y^^s-ea^ffittJ^Tox-f^^e&a :±is 
c£<Dttt»'W ha- f^o? **arii*iniccx*+> 

; ±fa«t»Bf3*i-5C4oacw%v ha- F:/n? $ 
rtCCfct^r, _bffi^^-f ha-FCDS**'*^ hn-F* 
firfSii^tc^+i->-r^ ; te<fci?±IE>*-f ha- F 
<D&*«cioTffiffl$*iS±IE'W ha- K(Dffe(D^r 
CC*for, ±ffi'^ ha- F<D-&**y>£S-£4. ± 

WSftStiOtcC^W ha- F^a ? ^BBtVW ha 

7^tfcflS«±fBtl3BfSnsci(D3Q:lw<^ ha-Fr^a 

*Jl»r % *Sffi«r*Jffi-rS*^-Y ha-Fi±IB*»rS 
-C±ISffl^^*r^S^V ha - FlfflCCiE^-ir^O 

!S»©s»o5 s> 3 a >r*s«rr 431 1 osssiffl 

l/T#ii*r£ ; fcJ:tf*-5ffi*£j5J»-<5£^-f h a- F 
<Dtctb<D±.uzm 1 <D«8SJ!y?f<Of6(D^"C(DSBCCfet,i 

r, ±iam i ofisgrtccfcws±E*»rsnac 

ts^a * ^<D1 o(c*Sffl4^jS-rS-S^V ha- Fi 

isms©** v ^(aa^w-r-ss^-Y ha- f<d/c»©u 

[0 0 0 1 ] 

[»W<7?IST5t»B^if] ##693 (Java) 

Wtt'^ ha- F<D^n ? £rt*5<fciy^ ha- K<D^ 

*7 r J )l&5&it^fr%0'T-~$ya--#~ry<DTA'< 
9 b (aspect) <D£.mcM~?2> 0 
[0 00 2] 

[Se*0iJ8f] 77 h ^xy^^fc<££»'a>t: 
(Da-1fK:io"C»l<E>ti-Cl>5 1 -3©l8liH«: % ^^b- 
i/a^i/^fAO^^-; h 7 * — ARBCCfcW S V 7 h^ 
xT©pI»14<D£&n-C*£. C(DBBjHCCffiiW/ci5C*« t 
7-Ji/F7-f F£*:ADftq#IKSS-rs*7 h 7-#ecfe 

iT'OStltc-fJv F7t-A(C3fci^f7'y i ^ hfg 
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ft<Da>fc: £l-f§<t OT, J AVA7'P^7 A#|g 
> v ^ 47 a is* t A J: o r B9% S ftfc. 

[0003] ^t^u *>'*4)^&twsRms5<Dm 
#r & £ c t J: ^ r ^ u - 3 > f* a ec*w s 

rt^teS^ + ^v — ^a— F#ix*^vs^ ha— Fifl* 
tthSiHra*Bl7*-Vy htCa>y\-^;U$n^ 0 y + 
rcr^f ha-Fi-nV Ft7'3- F (opcode) 
©^-4r>;ucJ;oTflte5$ft. S*^a- FttUff S 

7'a- FO»-^3&H*^vy-**Wr5. al-^a-FS 
■§■2 1 iload<varnu m>\t^vJx?£&Lv a r 
n u mfttCfettSnfc*— V- F<DB»ffi«rSX>3 Wit 

[0 0 0 4] ;Wha-Flt ^-wnVn'-^ ;U 

vi^— > (Java Virtual Machine 

JVM) CCJ; or ASSIST ha- F#fflW(D 
(n a t i v e ) -^>->a- FfcfflSRShS. J VM 

20 "{g*T (Dy'U-^yVO^^ y ^fc^-X<Dft#ffrC* 
J VMtc<torH?fS4iS/<^ ha- F«#Wfl«Jtc 

7F7*-A*>U< tt^ 3 >5/^ f Alt 5^ + 

^7>^A^fA(Java Runtime S 
ys t em) iP?«ti*. J VM^B^coaWfc^S* 
*tSCi*ir#, ha- FPfffib 

*THOH*^^-f-Atc;u- h{=t^-rs 0 
30 [0 0 0 5 ] 

[^*s(BKL<t5it5SH] JAVA^-Zha-F 

cms* b^zisBisiz v h*Ba#6-rstc^fcor. s 
stttt, ^- F^xroftfflffccct-sr+^icffi^r-* 
SCiiittc, S*ttffia*#x, ^iS^3tC7^--v 
v h Stifc^^-f ha- FfCifSHTSUffx^— PzsXt 

*T€)Ci^rS^$n/c 0 t^ + ^v^ ha- KttfSS 
(significant type) ©Wffi*£tf© 
r\ 1^SE*I3«. ha- F*W>*-* v h^Hff 

(extensive type) CD^-x ^ z^^j-f 

^-^U-^tS, H?fB#tC«/jN|B<D (minimal 
type) ^x ^ ZZmtf -?Z><Dfr-CJ:l,\ iZMMUffi 

all Ta 1 k<Dan*SSS±«:gft9. ^ + ^^«fSff 

[0006] y + ^ # tt^gi^iacc j: or^tt^fsfi 

U ^'W ha- F(C<t ^r njffltt (portab i I i 
50 ty) 44x5^. ^ + /<7"ay-7AtJC/C+ +<DftD 
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(Web Page) ±r^ + '< 7*p ASriLbtffc 

- F«D^CC^fb*»AT*C CCDS® 
{fctt. FTa->ft»{bg (S t an d - A 1 o 

ne Optimizer S AO) <t LT , SEt»ttS/ 
tXF>f>^A (J us t - i n-Time J I 10 

t) 3>^<^5<D-aBtLr$tf«<r©3!a«rA*{kr 

[0 00 7] SAO(^Vh3-K^tfA*^7X7 

h 3 - F *^t? A* ^7X77-/ ;u*Hff njfiEtt ^ 
^Afcg8W<&. J I T<OH«««rCB. JVMWo 
y5Art©^t(D^W ha- F-T h7*i/3>S^ 
fv^ftiai/rSttRWCC*^^-/ ^3- F3-;U^rHtf 
It^fc. 0#>U J I T3>;W^£ffiC>£i§£\ J 20 
VMttS-TJ I TKWt^tffflU**^ J ITtt^> 
* h 7^^3^^^-f 7*v 7*^ct^U-^3>VXf A 

S„ J IT^>^V7W-(f^^C3>A^j^n/c 

3- F**<fcO?SiStcSI?fShSC<h*nJS8CCO, 3- F 

TS. J IT3>;W7(i, Htf5JfiB&3- F** 

[0 0 0 8 ] A-f hn - FOftfflft ^l^^yAV^ 

^*#{Ctt*Pa*3g (intermediate 30 
representation I R) i LT^06tiT 
l^SV-^^-f F3- F^<Dffl!R*i&gi-f So I R 

^ay^A(D2o(DS*5K, fiPfe. SUiffl7 0-y 
^7 (CFG) isiVf-^P-^? (DFG) CC 
flUt" Sfi#K£^x.£o I R«3>ys-r^CCBBO 

[0 0 0 9 ] CFGK3- FSr^BfSnSCiO&l^ 

dfgb. a&^sti&mt^tib&tiimsti&ffi® 

4>S. I R<D/c»tCD FG*SfeJ5R-rS/cto. r-£:7P 
[ 0 0 1 0 ] x> F3.-lf^3>^-^;U$n^c> f, n>/-7 50 
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i T3>^v7toasi^«raaffcrsfcd&ofi535c 

«, S^'^ftfiftSi J I T3>/^^<DiIg<h#jsg 
**afbfS I R£Jf^TS/c#V~;^V ha-F* 
6DFG£^TS#ffi£tS{*T£t<tr&£ 0 

[0011] 

[BH*S&TSfc#©3M5] #IHJJ«. I R<Dtctb<D 
D F G©^(Dja^*3Jc#-r £ C t tC <fc o r + ^£ ^ 

X7 r -fiKDfta^t«:rolW6tl"Ct»S. $%0J3<Dgll<D 

8S««cJ:ft«, hv-X7r^^60«3nAj: 

F3— F>^3 ^ ^<DiWfaIX* + >tt, 4*W£*T, 
ttC^D ? ^rtOS^V h 3- FOBB*&*l5l3ETSfcft 

a ^ ZltKD&'H b 3- FCCOUT, DFG-/-F* S £. 
J5R3*i, :/p ? ^fttfVW F3- FfcJr^T&flBSfiS 

So 

[0012] *^©^20S8*5«:j:n«, h3- 

7s$y ^=KS8a*&3-- f^p v ^cc^CirafiSSia 
So CFGtCctor^snsssnA^T 5 — h^>Ftc 

ot^-C. C FG<D 1 o<D^X;!>^£ii$ft£o ^CC, DF 

^'P ^^c^ff^r^rco^P -y ^^Olit, IhID^^*^ 
^tfcSSflBfSrff^rs^^^ F3 - F(Dfc«)COy>^*sit 

¥^ns 0 

[0013] 

[ooi4] m i © (a) so' (b> *#e-ts 

i^ + ^^^V h3- Fv-*7 r -Y;K0Sa^fc<0fflB5 
ft7P-^^r ^^A 1 0*sJ:O*2 O^H7SS4art» 
S„ ffl*©7D-y^7y7A 1 0i2 OtCfc^Tfc, 
F 7 7 2 tii/ + ^^3>yv^7 1 

4CC j;ot + F3-F^7^7 7 ^H 6^C3 

>/W;U$nSo 7P-^W7^7A10t« 4 
7? -T;U«S AO 1 8 CCfe^riaeS*!. 7D-^7 
^7A2 Orti. ^7^77 ^bi.ii/tA'7>^-(A^ 

^rA2 2^:iSUr J I T 3>/nV ^ 2 4 CCgSnS. 
^t^SAO!8B. A* 7 r ^£151 DiSff* 



5 

Hfr-r<&;W Fa- K*#tf^ + ^<^^^7T-r;U2 6 

*«a^b3tifc*fficct5c»r©*a*-r4. jit24 

[0015] A*'W Fa- FV^^7X7 7 >fJKD 
ABIKfc^T. SAO<h J I TomfiWl R*fcj&-r 

±IBL/c$P<, IRttCFGiDFG^P^ 
ACDfrJW* Jrtff*- * 7 n -««©ffl*r e8S5tt*S 
{b-C$>-5) 0 CFGtt^oy^Afl!)||?f<DA:tf)(cy>*3 

-F^o-y^tt, *KSnSc(,»yjI/-^iL/racc||?f 
Sn5^-f ha- FOE^J«5T , *f3, 

[0016] Fa- F©^7^77^M 0 

^ 6 CO C F G <D %J$L & 9jk f V U - ¥ 4 T V ^ A V * £ . 

c f G<D£s5Wffi& *«®&n 6 ti r ^ s c i Passat ?n^> 

SrJIH#3&^W6n/dB)*{W^-f Fa-F-OXh-5 
^ya>^>X?:^i'7X77^t'4 0£fl3t>-C 
WteShS. l*»6N(DIIB#tt. S&!MC*5C>X*n5ft 
"Cl»4J:5tC, a>t*ju-£ 7'D^7 ACDtT^S^tttt 

7>fJK0«IT^> o 

[o o i 7 ] ffl'&yvi-mm, CFG4 4fCcfcori^ 

^ntt^ 0 CFG4 4&*, ^7X7 7 -{il/ 

3«:ft»]«:||fTO. >f>XF7^^3>100^610 
3*^CCj|tfU. ^>^h7^y 3 >10^614XB 
1 1 9(Dl»-rn^«rHtf 0, fif£fc^>*F5* 

><D*tfOBJCC||?fS*iS^*Ci*}g7nr4. CFG 
4 4ii*«c5Wr**3, S3SftH42CcJ:^r^^^^ 

[0 0 1 8 ] ±IZ<D#n<, CFG#— B£]£3*i4£, 
DFG^£ffiS<i5. #«9JK:J:htf. DFG<D2o(D 
S8»3&«CFG*>6*RE3ti4. lo<DS8«rttfc<. 

MKftj a v a ^7 AKfe^r a e»ht ^ s 
catena- vfvi-j ^i*3tc*siir**-r5^*<briS® 
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©Sa»**SfS«:B»3ti, *ti6(Dffi*«DFGrt6C*JC» 
T*ft6*/B^£-'*-Y ha- FCtit^tcy >^3h£c 

[0019] 5tffa«iBSttO. 

BUR©* * v *ttK^©a sseersfc 

»SfflSn/c£B&^7 : ---S/*F!l7SL/rc»5. Xf-y 

6 0 4>HL Fa- F6 2 ©A*^-^ 

v 2 6 4CD**^-l>3>*£fi£ 
-TZtctbtfCis* a U- F £h£ 0 6 4"CjftS*iS^* y 
Fa- F6 2<DA*i'-4 r >>«0— S5^6 6 CD 

h. POPY>^h7^*>3^7 0*«jfetfTSBIPU 
SH "2" -Y>X F7^^>3>7 2CCj:orffl "2" <h 

7 4 £7 6tCj;otI$n^DFGy- FrtfctMftSft 

[0 0 2 0 ] ##69!<D#* u^isswc «tft«, F 
7'J-ffljn-5 fl ;W ha- F^7X77 -YjUcJ: 

£ft£ 0 C©y- F«4i»ltcttSJ:5K:, Cft£<E<D£ 
fifcb <fc *c U > £ £ n 5 c £ tc <£ T y V -<oflfc 

[0 0 2 1 ] l^4CC^o-C. *»qaccj:5, ha- 
F^n * ^rttefcWS »J>^<D£^ffi<D:7a-^7 

•Y ha- FV-X77 -Y;H 0 2CDfuii77f^X+^> 

&*D<DftnOW ha- Ft-*Stt5SSX*«3 ^^CD 
-C,^WFa-F112, 1 1 4feiO'l 1 6©HJS 
(^^-h) 1 0 6, 1 0 8fcJ:^l lO^^^T 

40 [0 02 2] yccc, Xf77'l 1 8tCfcl>T, ^'-Y h a 
- FV-^77 A )\* 1 0 2<oailj&rSl^* + >A^?f 3 
hS. g^*t>Xf-7^1 lStii, D F G(DS 

- F^ne^i-ratcufc^orfejasna. CTx. 

tt, Fa- F 1 16*W>^h7^^a>P0Pr 
Ah«. POP^8R6hSi/- F 1 2 O^POPY 
>Xh7^y 3 >l 2 2*StfDFGrtCC*JSShS. 
£Ct. ^>^h7^y 3 >BI PUSH "2" h 
a- F 1 0 8-CKHX6*i££. 12 4JCfeli"C, fi 
"2" ^hSr^ffltSPOP ^>^h^^^>3>12 
50 2(C'J>^§ftS a 
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[0 0 2 3 ] #169! tt, X £ v ^<Da tf-0«HB±»© 

sttBfcJWnfc^fcS. **WB, ha- FO* h y 

£>£<Dr, ;W ha- F-Y>x h^^^ 3 >tcj:- 3 rr8 
»SttS*^5>F**t»ttffl*W>* bZirtsa^tc 

7v7&ju ha- F3&*BE*nfc». e^icj^f$n 

10 

[0 0 2 4] ^(**/7^--rSSH'CBftl,»Ai. MttS 
ffl#*fSWc«fcS;<-f Fra-F-f h-5 £^>a 
*feJ«Stl4DFGrt<0y- Ktcot^r, 05 (A) £ 

(B) ecH^snrc^-s. 05 (a) £ (B) b, ^ 

&<DV>2 *n-o1c 4 >X h ^ 2 is 3 y -x©w 

[0 02 5 ] 05 (A) tCfctit, ;^h3 - Fv-X 

-Yha-F15 2, 1 5 4*5 J: CM 5 8CC#W6ftS. 20 

» ( I ADD) -T>X h^^>3>#;W ha- F 1 5 
8#>6KK6ttS££fe«:. K 1 6 0 #D F GftCC 
fe!c?n5o -f>X F7^ 3 > I ADDOSS^^ 
-F 1 6 0tt2-3(Dffl*?g«-f 4Ci3&i*a6tirc>S 8 

ha- h* 1 5 4 £ 1 5 2^6ScBX6tl. DFGCD^-F 

yy-rtoy- F l 6 2 £ l 6 4 y- 
F i 6 occy >^$hs. 

[0 0 2 6 ] 05 (B) tcfeor. ha- FV-X 30 
^P^16 6«HJil*rSlX+ + >CC<fc^r^<-f ha- 
F 1 6 8 , 17 0teJ:CM 7 2 tc*TE^3liS a 
ha — F 1 6 6<Di/—tr2sX1tZX'T-~ h^ > h (X + 
X) Sr^Sfi-TS. aii^'[fiIX* + >X7 L ^^r« % ftW 
tCffl^fc^V ha — Fl 7 2 B I A D D -Y >X h ^ ^ 

^3>r&& 0 H5 (A) cDWcKiurKHjofcJ:^ 

tC, IADDY>^ h ^£*> 3 XiD FGrttCfcC^r 2 
rxofflioy >^4}#ofc-/- K 1 7 4*iJSt5. 'JR 
<0*4 ha - F 1 7 (DUP) ^>*h^^> 

3>-C£>£CDr, F 1 7 4frtb<Dmji<DV >i?BD 40 
UPY>^ h^^> a >>- F 1 7 6tC*fUrft 

^n^ 0 ^c. FX ( I LOADX) -OX h 

F 1 7 6«:*f-r*y >^*f*ofcy- F 1 7 8#±J&3 

f^ y-tcfewsccD^-^^^tcfcc^r. 

y- F 1 7 8 0ffi«y- F 1 7 6(Cffl^S4i % M^Ofil 
B^ - F 1 7 4CCfei,>T?SSt£ft£. ±5C}8tffcCT^6 

HtT*4l«. S JSS ftfc1#3g© D F GttC F GCC i -j t 
4Sol7/c7'a * #rtc*vw ha- F<D^--2r>xK:te 50 
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[0 0 27]*«W©»2<O»«CCj:tl«, CFGrtOD 

h*-F7uv 2m<D&oBmifi^$ti2>. me 

By >££ft/ca- F2 0 0, 2 02, 2 0 4*^0*2 
0 6<DS»©:/o? ^*^fcCFGl 8 O^TnLTU 
5o C F G 1 8 0 r B^ 3^>(join) 208iC5fe 
tf-T^^n **202£20B KWCftS £*jt6h 

*^«f:^6n^^-C^^ CFG#£j££ti££, 
X^7^i2 10, 2 12, 2 14fe<ttf2i6^ 
•Yha-F2 0 0, 202, 2 0 4fc«fcO*2 0 6 OD&T' 

5? ? ^CCHl^-ra^D v#ft<D'H ha- FWHtf Sii 

£ y 2>±<Dffitt^-Y h a- F^n v t>fo<D-{>X Yvf 

is 3 >cc j: -^r^sti* it^ci *ssa-r^#-ca 

£ 0 C<DC£B, W>X h ^^>3>2 1 8 £ 2 2 0 CO 
X* ? £tfc®2 2 2 £ 2 2 A^<D^<0^y f>yfcj: 

XF7^>a>2 1 8 B PUSH "1" "C#> 9 , -Y>X 
h7^>a>2 20ttPUSH "O" t*^ Y>^F 

7^i/3>2 2 6ttPOPt*4. 
[0 0 2 8] ha- F<D*&a><D^Ci v ^rtCt&SS-' 

tifcffl*s^-Y ha- F(D^a v ^fflCcfc^^r D F Grt© 

y>^*^*r4fc»tc^-r ha - FOtl^< 

CCDWfRB^V ha - FO?g»«!;^a<0y- Fffl©';> 
i'iUtDFGfitcSSShS. iTOWftT'D^ 

( i terat ing) CC tfc?? U T I > fc. Witt, CF 

g i 8 orto^u ^ ^RflcDy >^cco^r dfg^m 

■TS/cefo. SlgSM7 J D^2 0 0, 2 0 2*5^2 
0 4*dfeSB8*iiL/-Cffe)*i. ^^r, »2eBS^^ 
D^2 0 0, 2 0 6fc<fcO'2 0 4*$tagg&£ffiLT 

[0 0 2 9]*^tcJ:ntt. ^Wha-FWf^a 
^PbWJ >^CC-^c^TD FG^Mf^/c^c^t© 

0, 2 0 2. 2 0 4*$cfcO*2 0 6 PdKDD F Grt<D y >^ 
B, 7'o-^2 0 0, 2 0 2, 2 0 4£i!^r^i!U 
*<D®, 7-D y 2 2 0 6frc*f-T5 y yt>Z®±LTZ>tc£> 
tc. ^O7>2 00, 2 0 2. 2 0 44iiLfcSK*6 

>? 1- v ^ommm^io <k o'±is o /c^!i«aawcc: y * - 



(6) 

9 

[0 0 3 0] Wx.K* CFG1 8 OCCfc^T, y 3 -f> 
2 0 8CC5tfT-rSXdr „^ttJ82 1 2 i 2 1 6©Pf#tt 

nv*2 0 6rt<D*Sffl©mBi 3 Mf«cfftttoni. * 
* * v *ttSBffiiE2 2 2 Ccfcfefrrsfifi 
*S3-F^P^^2 0 4GD2 2 6«CtoW4^-r h3- F 

a - F-f h7>^3>22 6tt*****£tefiHB2 
2 4K#j£T£ffi£&<£fflir£. io 

[0031] Fn- F (pseudocode) 

®«T<Dtf tt**inLr 1*5. feL ( I F) (M63&>© 
U/c 6) 

X= 1 : 

^■<Dffe(ELSE) 
X= 0 ; 



A = X+2 

[0 0 3 2]S"A = X+2 M CCfcWS "A" £ 20 

ffi<z>tt#Ktt. - 1 f (ra6^©^#2S£-rn«) M 

[0 0 3 3 ] 07 (A) CCfcOT. *»B J B<DS2©S8«l 
CCO/c^orDFG^^Jffi-rSfcfeCCfigffiStifc^f'y 

^f9^2 5 OtCfc^T, CFG^:7<D^l<E>:/^> 
**sx* + >$h£. H6CDCFG1 8 0 CC®fflT£ 
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This application is submitted in the name of inventor Sanjoy Ghosh, assignor to Snn 
Microsystems, Inc., a California Corporation. 

SEECIEICATXGE 

METHOD FOR GENERATING A 
5 JAVA BYTECODE DATA FLOW GRAPH 

BACKGROUND OF THE INVENTION 

L Field of the Invention 

The present invention relates to the optimization of Java class files. More 
particularly, the present invention relates to the generation of aspects of a data flow 
10 graph from a Java classfile optimizer that identifies the propagation of data values in 
blocks of bytecode and between blocks of bytecode. 

2. The Prior Art 

A known problem for software developers and computer users is the lack of 
portability of software across operating system platforms. Attempts to address this 
1 5 problem must include means of ensuring security as computers are linked together in 

ever expanding networks, such as the World Wide Web. As a response to both of these 
concerns, the JAVA programming language was developed at Sim Microsystems as a 
platform independent, object oriented computer language designed to include several 
layers of security protection. 
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lava achieves its operating system independence by being both a compiled and 
interpreted language. First, Java source code, which consists of Java classfiles, is 
compiled into a generic intermediate format called Java bytecode. Java's bytecodes 
consist of a sequence of single byte opcodes, each of which identify a particular 
5 operation to be carried out Additionally, some of the opcodes have parameters. For 
example, opcode number 21, Hood <vamum> t takes the single-word integer value 
stored in the local variable, varnum, and pushes it onto a stack. 

Next, the bytecodes are interpreted by a Java Virtual Machine (JVM) which " 
translates the bytecodes into native machine code. The JVM is a stacked-based 
implementation of a "virtual" processor that shares many characteristics with physical 
microprocessors. The bytecodes executed by the JVM are essentially a machine 
instruction set, and as will be appreciated by those of ordinary skill in the art, arc similar 
to the assembly language of a computing machine. Accordingly, every hardware 
platform or operating system may have a unique implementation of the JVM, called a 
Java Runtime System, to route the universal bytecode calls to the underlying native 
system. 

When developing the JAVA bytecode instruction set, the designers sought to 
ensure that it was simple enough for hardware optimization and also included 
verification means to provide security protection and to prevent the execution errors or 
20 system crashes that can result from improperly formatted bytecode. As Java's bytecodes 
contain significant type information, the verification means are able to do extensive type 
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checking when the bytecodes are first retrieved from the internet or a local disk. As a 
result, the interpreter of the native machine need only perform minimal type checking at 
run time. Unlike languages such as SmallTalk that provide protection by performing 
extensive runtime checks, Java executes more quickly at run time; 

5 Although Java provides security through verification means and portability 

through bytecodes, Java programs lag natively compiled programs, written in languages 
like C/C++, in their execution time. When a user activates a Java program on a Web 
Page, the user must wait not only for the program to download but also to be 
interpreted. To improve Java's execution time, optimizations can be introduced into the 
1 0 processing of Java bytecodes. These optimizations can be implemented in a variety of 
maimers including as Stand-Alone Optimizers (SAO) or as part of Just-in-Time (JTT) 
compilers. 

A SAO transforms an input classfile containing bytecode into an output classfile 
containing bytecodes that more efficiently perform the same operations. A JIT 

1 5 transforms an input classfile containing bytecode into an executable program. Prior to 
the development of JTTs, a JVM would step through all the bytecode instructions in a 
program and mechanically perform the native code calls. With a JIT compiler, however, 
the JVM first makes a call to the JTT which compiles the instructions into native code 
that is then run directly on the native operating system. The JTT compiler permits 

20 natively complied code to run faster and makes it so that the code only needs to be 

compiled once. Further, JTT compilers offer a stage at which the executable code can be 

3 
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optimized. 

To either optimi2e or compile bytecodes involves the translation of the source 
bytecodes into what is known in the art as an intermediate representation (IR). The IR 
provides information about two essential components of a program: the control flow 
5 graph (CFG) and the data flow graph (DFG). Subsequently, the IR is transformed for 
compilers into object code and for optimizers into an improved version of the source 
code format. 

The CFG breaks the code into blocks of bytecode that are always performed as 
an uninterrupted group and establishes the connections that link the blocks together. 
The DFG maps the connections between where values are produced and where they are 
used. This includes connections within blocks of bytecodes and also connections 
between blocks of bytecodes. Because Java programs frequently use stack locations as 
implicit variables, tracking the propagation of values is a difficult process. When 
extracting the data flow information to generate the DFG for the IR, the prior art 
approach has been to perform a complete simulation of both the stack and iterations 
through all possible flow paths . 

In contrast to traditional compilers where end users only interact with compiled 
programs, the run time nature of Java creates a significant incentive to optimize the 
speed and efficiency of Java optimizers and JIT compilers. Accordingly, it is an object of 
20 the present invention to provide a method for generating a DFG from the source 
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by tecodes to form an IR that optimizes the speed and efficiency of Java optimizers and 
JIT compilers. 

. These and many other objects and advantages of the present invention will 
become apparent to those of ordinary skill in the art from a consideration of the 
5 drawings and ensuing description of the invention. 

STWTMARYOFTHE INVENTION 
The present invention is directed to the optimization of Java class files by 
improving the efficiency of generating a DFG for the IR. 

^ 0 According to a first aspect of the present invention, a forward.scan of an 

uninterrupted bytecode block from a by tecode source file is performed to identify the 
start of each bytecode in the uninterrupted block. Next, a backwards scan is performed. 
For each bytecode in the block, a DFG node is created, links to all the values used by the 
bytecode in the block are established, and links to preceding or subsequent nodes in the 

1 5 file are created. 

According to a second aspect of the invention, the bytecode blocks are linked in 
the file's order of execution and a stack state is generated for each block of code. For 
each join statement generated by the CFG, one path of the CFG is stepped through. 
Next, links are established in the DFG between locations where values are used and 
20 where the values are produced. Finally, for all blocks parallel to the blocks producing 

5 
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values the links for bytecodes occupying the same stack state location are duplicated. 

BRIEF DESCRIPTION OF THE DRAWINGS 

iFIG. 1 A is a flow diagram of the optimization of a JAVA bytecode source file 
using a stand alone optimizer according to the present invention: 

5 FIG. IB is a flow diagram of the optimisation of JAVA bytecode source file using 

a just in time compiler according to the present invention. 

FIG. 2 is a flow diagram illustrating the generation of control flow graph from a 
bytecode source file suitable for use in the present invention. 

FIG. 3 is a flow diagram of the generation of data flow graph according to the 
1 0 prior art 

FIG. 4 is a flow diagram of a first aspect of the generation of a data flow graph for 
the Knlc* in a block of bytecode according to the present invention. 

FIG. 5A is a block diagram for the generation of a node in a data flow graph for 
the instruction IADD according to the first aspect of the present invention. 

1 5 FIG. 5B is a block diagram for the generation of several nodes in a data flow 

graph for a plurality of instructions according to the first aspect of the present invention. 

6 
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FIG. 6 is a flow diagram of a control flow graph suitable for illustrating a second 
aspect of the generation of a data flow graph for the links between blocks of by tecode 
according to the present invention. 

FIG. 7A is a. block diagram of the steps in the method according to the second 
aspect of the present invention. 

FIG. 7B is a block diagram for the generation of nodes in a data flow graph 
according to the second aspect of the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Those of ordinary skill in the art will realize that the following description of the 
present invention is illustrative only and is not intended to be in any way limiting. Other 
embodiments of the invention will readily suggest themselves to such skilled persons 
from an examination of the within disclosure. 

Referring first to FIGS. 1 A and IB, simplified flow diagrams 10 and 20 of the 
optimization of a Java by tecode source file are illustrated. In both flow diagrams 10 and 
20, Java source code file 12 is compiled by a Java compiler 14 into Java bytecode 
classfiles 16. In flow diagram 10, the classfiles arevoperated on by a SAO 18, and in flow 
diagram 20, the classfiles are passed through a Java Runtime System 22 to a JIT compiler 
24. The Java SAO 18 outputs Java classfiles 26 containing bytecodes that perform the 
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same operations as the input classfiles, only in an optimized manner. The JIT 24 outputs 
.executable code 28 that can run directly on the native operating system 30. Those of 
ordinary skill in the art will recognize that the procedures described in FIG. 1 are merely 
illustrative and that there exist other structures within which classfiles may be optimized. 

5 In processing an input bytecode source classfile, both an SAO and a JTT form an 

ER. As described above, an IR is a succinct and straightforward structuring of the 
control and data flow information of a program into a CFG and DFG. The CFG 
represents the information regarding the sequence and order of the execution of the 
bytecode instructions as blocks of code linked in the order of execution of the program. 
1 0 The blocks of code are sections of bytecode that are always performed as an 

uninterrupted group, and the links between the blocks of code are bytecode branch, 
conditional, or jump instructions. 

FIG. 2 is a flow diagram that illustrates the generation of the CFG from a 
bytecode classfile 40. It should be appreciated that there are several known methods for 

1 5 generating a CFG. The specifics of any such method will not be disclosed herein to 
avoid overcomplicating the disclosure and thereby obscuring the present invention. 
The procedure begins with a classfile 40 containing an ordered sequence of bytecode 
instructions numbered from 1 through N. The numbers 1 through N are analogous to the 
numbering of lines in a computer program as is known in the art. Next, an optimizer 42 

20 processes the bytecode classfile 40 to extract the control flow information from the 
bytecodes. 
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The control flow information is illustrated by the CFG 44. For example, the CFG 
44 may indicate that for classfile 40 that bytecode instructions 1 through 3 are first 
executed, instructions 100 through 103 are next executed, either instructions 10 through 
14 or 15 through 19 are next executed, and finally instructions 4 through 8 are 
5 executed before the remainder of the bytecode instructions execute. It will be 

appreciated that the CFG 44 is an example only, and that any number of CFGs may be 
generated from the classfile 40 by the optimizer 42. 

As described above, once the CFG has been generated, the DFG will be 
generated. According to the present invention, two aspects of the DFG are generated 

1 0 from the CFG. In one aspect, the propagation of values within blocks of bytecode is 
determined. As is known in a typical JAVA program, data values are produced in one 
location and consumed in another. The first aspect of the present invention relates to 
establishing the connections between the production and consumption points that 
occur within uninterrupted blocks of code. The present invention allows implicit 

1 5 variables to be easily recognized and for their values to be immediately linked in the 
DFG to the bytccodc(s) using them. 

Unlike the prior art, the present invention avoids the approach taken in the prior 
art of constructing a stack to determine the values used by bytecodes. For example, in 
FIG. 3, a block diagram of a known method for generating links within a code block of a 
20 CFG is illustrated. The diagram demonstrates the principle stages employed to determine 
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the value of implicit stack location variables when generating a DFG. During stage 60, 
the operations of an input sequence of by tecodes 62 are simulated to create a version of 
the stack 64 that they represent The stack shown 64 presents only a portion 66 of the 
input sequence of bytecodes 62. During stage 68, the stack 64 is executed, and it is 
5 determined that the POP instruction 70 is associated with the value "2" due to the 
preceding BIPUSH "2" 72 instruction. This information is stored in a DFG node 
represented by reference numerals 74 and 76. 

According to rhe preferred embodiment of the present invention, a node tree is 
employed When values are produced by the bytecode classfile, a new node is created 
10 in the tree. This node is then connected to the rest of the tree by linking it to the value's 
production and consumption points, as they become known. In this manner, the 
propagation of data values through a program can be readily traced. Those of ordinary 
skill in the art will recognize that a variety of data structures could be used. 

Turning now to FIG. 4, a flow diagram of a method for generating links within a 
1 5 bytecode block according to the present invention is illustrated. In a first step 100, a 
forward scan of a bytecode source file 102 is performed in order to determine the starts 
106, 108, and 110 of the bytecodes 112, 114, and 116, respectively, because as is known, 
bytecodes can be of different lengths. Next, at step 118 a backward scan of the 
bytecode source file 102 is performed. During the backward scan step 118, the nodes of 
20 the DFG are generated as they occur. For example, if bytecode 1 16 is the instruction 
POP, as POP is read, a node 120 is created in the DFG including a POP instruction 122. 
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If next, the instruction BEPUSH "2" is read at bytecode 108, 
linked to the POP instruction 122 that consumes it. 

The present invention provides a significant increase in the efficiency in the 
generation of the DFG because the entire process of constructing a copy of the stack 
5 has been eliminated. The invention recognizes that because the stream of bytecodes are 
the instructions for a stack machine, the operands or values consumed by a bytecode 
instruction precede the instruction. When the bytecode file is read in reverse order, the 
step of creating a node in the DFG can be immediately performed after the bytecode is 
read. 

1 0 Further examples, not intended to be exhaustive, are illustrated in FIGS. 5 A and 

5B of the nodes in a DFG created from bytecode instructions according to the present 
invention- FIGS. 5 A and 5B respectively are examples of a single instruction with two 
links, and a series of instructions with multiple links. 

In FIG. 5 A, a bytecode source block 150 has been first differentiated by a forward 
1 5 scan into bytecodes 152, 154 and 158. When the backward scan step is performed, an 
integer addition (IAJ)D) instruction is read from bytecode 158, and a node 160 is created 
in the DFG. From the definition of the instruction IADD it is known that node 160 will 
consume two values. As the backward scan continues, these two values are read from 
bytecodes 154 and 152, created as nodes 162 and 164 in the node tree of the DFG, and 
20 then linked to node 160. 

11 



SUNP-2412 
the value "2" at 124 is 



ftfflW- 1 1 -2 42 5 9 7 



W^'v 16 2 8 8 0 

SUNP-2412 

In FIG. 5B, a bytecode source block 166 has been first differentiated by a forward 
scan into bytecodes 168, 170 and 172. The sequence of bytecodes 166 implements the 
statement (X + X). In the backwards scan step the first bytecode 172 encountered is an 
IADD instruction. As explained with respect to the example of FIG. 5 A, the IADD 
5 instruction generates a node 174 in the DFG with links to two values. Since the next 
bytecode 170 is a duplicate (DUP) instruction both of the links from node 174 are made 
to the node 176 for the DUP instruction. Next, bytecode 168 having an integer load X 
(ILOADX) instruction is read, and a node 178 with a link to node 176 is created. In this 
sequence hi/the node tree, the value in node 178, will be duplicated in node 176, and 
1 0 both values will be consumed in node 174. As will be appreciated by those of ordinary 
skill in the art from the above recited examples, when performing the backwards scan 
step, the particular DFG created will depend upon the sequence of bytecodes in the 
blocks generated by the CFG. 



According to a second aspect of the present invention, the propagation of values 
1 5 between blocks of bytecode in the CFG is determined. FIG. 6 illustrates a CFG 180 
having several blocks of linked code 200, 202, 204, and 206, In the CFG 180, the 
blocks 202 and 206, that precede a join 208 are considered parallel. It should be 
appreciated that a join may connect a plurality of parallel blocks. When the CFG is 
generated, stack states, 210, 212, 214, and 216 are created for each block of bytecode 
20 200, 202, 204, and 206, respectively. .The stack states includes all the values still present 
on the stack after the bytecodes in the block associated with that stack have been 
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executed. It should be appreciated that values on the stack are produced by 
instructions in the bytecode blocks. This is shown by the mapping of the instructions 
218 and 220 to the stack states 222 and 224, respectively. For purposes of illustration 
to be described below, instruction 218 is PUSH "1". instruction 220 is PUSH "0". and 
5 instruction 226 is POP. 

To determine when a value produced in an early block of bytecode is used or 
consumed by a subsequent block of bytecode to generate a link in the DFG between 
blocks of bytecode, an execution of the blocks is made while noting the consumption of. 
a prior stack state value and identifying its source. This information is represented in the 
DFG as a link between the nodes of the consumption and production of the bytecodes. 
To establish links for ail parallel blocks, The method in the prior art relied upon iterating 
through all possible control flow paths in the CFG. For example, to generate the DFG 
for links between blocks in the CFG 180, a first pass would be made through the path 
including blocks 200, 202 and 204, and then a second pass would be made through the 
path including blocks 200, 206 and 204. . 

According to the present invention, not all control flow paths must be traversed 
to form the DFG for links between parallel blocks of bytecode. Rather, the links in the 
DFG between blocks 200, 202, 204 and 206 can be generated simply by stepping 
20 through blocks 200, 202, 204 and then applying the information from the pass through 
blocks 200, 202, 204 to establish the Links to block 206. The present invention exploits 
the fact that Java standards 1 require and the verification described above ensures that in 
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properly formatted code, the stack states for any of the blocks preceding a join are the 
same. 

For example, in CFG 1 80, both of the stack states 212 and 216 that precede the 
join 208 must be the same in both size and relative type. During execution, the 
5 consumption of any value in block 202 will be paralleled by a consumption of a value in 
block 206, As a result, for example, if the value corresponding to the suck state location 
222 is used by the bytecode instructions 226 of code block 204, then the bytecode 
instruction 226 also uses the value corresponding to the stack state location 224. 



The following lines of pseudocode illustrate the point. 
1 0 EF (some condition occurs) 

X=l; 

ELSE 

X = 0; 

15 

» 

A = X-h2; 

The assignment of a value to "A" in the expression M A = X + 2" uses the value from 
either, expression "X = 1" or X = 0" that has been assigned to the implicit stack 
20 variable "X" as determined by the "IF (some condition occurs)" statement. 
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In FIG, 7A, a block diagram outlining the steps employed to generate a DFG 
according to the second aspect of the present invention is illustrated. 

At step 250, a first branch of a CFG graph is scanned. Applied to CFG 180 in 
FIG. 6, a first pass / through blocks 200, 202 and 204 is performed 

5 Next at step 252, links between the place a value is created and the place a value 

is consumed are made. As illustrated in FIG..7B, a link 254 is made between the 
PUSH 44 1" instruction at location 218 in the CFG 180 that the POP instruction consumes 
at location 226 in the CFG 180. 

Next at step 256, a link 258 is made between the PUSH 0 instruction at location 
1 0 220 that the POP instruction consumes at location 226, because it is known that the 

data for this instruction occupies the same respective stack state location as the PUSH 1 
instruction at location 218 in the CFG 180. As such, without traversing all paths in the 
CFG as relied upon by the prior art, according to the second aspect of the present 
invention, the Knlcs in a DFG between blocks of bytecodes can be more efficiently made. 
15 As joins are quite common throughout bytecode source code, the present invention 

represents a significant increase in efficiency. Where multiple joins exist, the increase in 
efficiency is even greater. 

While illustrative embodiments and applications of this invention have been 
shown and described, it would be apparent to those skilled in the art that many more 
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modifications than have been mentioned above are possible without departing from the 
inventive concepts set forth herein. The invention, therefore, is not to be limited except 
in the spirit of the appended claims. 
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CLAIMS 

What is claimed is: 

1. A method for linking bytecodes of an uninterrupted block of bytecodes in the 
formation of a data flow graph comprising the steps of; 

5 scanning said uninterrupted block of bytecodes in a forward manner to identify 

the start of each of said bytecodes; 

scanning in a backward manner bytecodewise each of said bytecodes in said 
uninterrupted block of bytecodes; and 

generating a link in said data flow graph that links each of said bytecodes to all 
1 0 other of said bytecodes used by said each of said bytecodes. 

2. A method for linking byteccdes between uninterrupted blocks of bytecodes in 
the formation of a data flow graph, said uninterrupted blocks of bytecodes having links 
according to an order of execution of said uninterrupted blocks and wherein a stack 
state has been generated for each of said uninterrupted blocks of bytecodes, comprising 

15 the steps of: 

stepping through a first path of a plurality of paths of said order of execution that 
terminates in a join to generate a link in said data flow graph between each bytecode 
producing a value in one of said uninterrupted blocks and each bytecode consuming 
said value in another of said uninterrupted blocks in said first path; and 
20 duplicating each link in said first path with a link for each bytecode in all of said 

17 
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plurality of paths other than said first path for each bytecode producing a value having 
a similar stack location to each bytecode producing a value in one of said uninterrupted 
blocks in said first path. 
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Step One : Scan one branch of control flow graph 
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Step Two : Establish links between where a plurality of values are produced 
and where the piuralty of values consumed 
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Step Three : Duplicate 8nk($) to consumption point(s) from aO production 

potni(s) that have the same respective stock stale 
locations) as production points in step two.. 
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ABSTRACT OF THE DISCLOSURE 
According to a first aspect of the present invention, a method for linking 
bytecodes of an uninterrupted block of bytecodes in the formation of a data flow graph 
comprises the steps of scanning the uninterrupted block of bytecodes in a forward 
5 manner to identify the start of each of the bytecodes, scanning in a backward manner 
- bytecodewise each of the bytecodes in the uninterrupted block of bytecodes, and 
generating a link in the data flow graph that links each of the bytecodes to all other of 
the bytecodes used by the each of the bytecodes. 

. According to a second aspect of the present invention, a method for linking 
1 0 bytecodes between uninterrupted blocks of bytecodes in the formation of a data flow 
graph, the uninterrupted blocks of bytecodes having links according to an order of 
execution of the uninterrupted blocks and wherein a stack state has been generated for 
each of the uninterrupted blocks of bytecodes, comprises the steps of stepping through 
a first path of a plurality of paths of the order of execution that terminates in a join to 
1 5 generate a link in the data flow graph between each bytecode producing a value in one 
of the uninterrupted blocks and each bytecode consuming the value in another of the 
uninterrupted blocks in the first path, and duplicating each link in the first path with a 
link for each bytecode in all of the plurality of paths other than the first path for each 
bytecode producing a value having a similar stack location to each bytecode producing 
20 a value in one of the uninterrupted blocks in the first path. 



